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Abstract 

The first cDNA from the Photurinae subfamiiy of the Lampyridae encoding a firefly luciferase from lantern mRNA of 
Photuris pennsylvanica has been cloned, sequenced, the amino-acid sequence predicted and the sequence reported to 
GenBank. The cDNA was about 1.8 kb in length with the largest open reading frame coding for a 545-residue protein. The 
5' noncoding region is 61 bp long and the 3' noncoding region is 135 bp in length. There is a 24-nucleotide poly(A) tail. 
When the amino-acid residues are aligned. P. pennsylvanica contains 154 (about 28% of the total residues) that are 
conserved in all 16 of the deduced luciferase sequences that are presently available. In this P. pennsylvanica luciferase. the 
amino acids ac 276 of the positions are the same at corresponding positions of at least one of the. other enzymes. There are 
two amino-acid differences between this luciferase and the unpublished sequence obtained by Dr. Keith Wood for a putative 
larval Phontris firefly luciferase cloned from a Maryland firefly. Signature amino-acid sequences and domains found in the 
deduced sequence are for adenylate kinase, the putative AiVlP -binding domain, luciferin 4-mcnooxygenase, 4-coumarate 
CoA ligase, long-chain fatty acid CoA ligase, 2-acylglycerophosphoethanolamine acy I transferase, the micro body-directing 
sequence, pepude-synthesizing complexes, and acyladenylate-synthesizing enzymes. 

Keywords: Luciferase; Bioluminescence; Luciferin; Primary structure; {P. pennsylvanica); (Firefly.) 



1. Introduction 

Firefly luciferase (Photimis luciferin: oxygen 4- 
oxidoreductase. EC 1.13.12.7, abbreviated LUC) pro- 
duces light by the oxidative decarboxylation of lu- 
ciferin (LH 2 ) as shown in the following equations 
that represent the two-step reaction. 



" Corresponding author. Fax: -r- I (405) 7447799; E-mail: 
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Step one forms an enzyme-bound luciferyi adeny- 
late 

LH, 4- MgATP 4- LUC ^ LUC-LH 2 -AMP 4- MgPP ( 

Step two is the oxidative decarboxylation of lu- 
ciferin with the production of light upon decay of the 
excited form of oxyluciferin 

LUC-LH 2 -AMP 4- Q 2 4- OH" 

LUC-OL 4- CO. 4- AMP 4- light 4- H ? 0 

There is a slow release of the oxyluciferin product. 
OL, from the enzyme -product complex. 

Tne enzymatic reaction has a quantum yield of 
0.88 photon/molecule of luciferin oxidized [1]. The 
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enzyme is widely used in the quantitation of ATP and 
other biochemically important compounds [2-5] and 
as reporter of gene expression (see bibliographic lists 
in J. Biolumin. Chemilumin. 5, 141-152 (1990) and 
8, 267-291 (1993)). 

The gene for firefly luciferase of the North Ameri- 
can firefly, Photinus pyralis^ was cloned and se- 
quenced by DeLuca and colleagues [6,7]. Since those 
studies, firefly luciferase cDNAs or genes have now 
been cloned from several other beetle species; these 
are listed in Table 1. This paper reports the cloning 
and sequencing of a cDNA for firefly luciferase- from 
Photuris pennsyluanica', this is the first species from 
the Photurinae subfamily to have its luciferase se- 
quence determined and reported to GenBank. Pho- 
turis pennsyluanica is a twilight/ night- active firefly 
while the common, weil-characterized North Ameri- 
can species, P. pyralis, is dusk-active, flashing only 
during twilight. Before mating, Photuris fireflies fe- 
males respond to courtship signals of conspecific 
males [15]. After mating, they become "femmes fa- 
tales' by answering the courtship flashes of males of 
other species who are then treated as prey. Predation 
by aggressive mimicry is known only for Photuris 
[161 

The Photuris genus has been mainly studied from 
the biological standpoint. The biochemical and struc- 
tural changes that occur during light organ develop- 
ment have been studied by Strause et al. [17]. During 
development the larval light organ regresses and is 
replaced by the adult lantern. During pupation the 
levels of luciferase and luciferin remain constant in 
the posterior half of the pupa while there is an initial 
increase followed by a decrease of luciferase and 
luciferin in the anterior half. Strause and DeLuca [18] 
found a luciferase isozyme in larval Photuris penn- 
syluanica that is distinct from the enzyme of the 
adult. This laboratory has identified two firefly lu- 
ciferases from adult P. pennsyluanica lanterns during 
Sephadex G-150 chromatography [14]. 



Fig. 1. Nucleotide and predicted amino acids sequences for 
cDNA from Photuris pennsyluanica. The 6 1 bp leader is shown 
in groups of 10 bp with a residual. The coding sequence starts at 
bp 62 and continues through bp 1700. The 5' and 3' noncoding 
region is shown in groups of 10 bp. The figure was prepared by 
editing a DNA Sender report after the consensus sequence was 
derived using the Assembly LIGN program. 



gaeacgagee A ga cvyji cc gcacagrecg acitauuc ceaaageeac aagecgaaac a 
1/1 3 IV II 

a eg gaa gac aaa uc acc cea cac gga cee gaa ecu tct cac coc eey gee gac ggg acg 
61/21 91/31 

get can gan cat* ace tee ue gea cea eee cca eac gca gae ace cca gea cee aee gea 
AGSQ:<?rALSRYA = I S G C I A 
121/41 1*1/51 

ccg aca aac gee cat aca aaa gaa aac get tea tac gaa caa tec eta aaa ccg teg tgc 
181/61 2U/71 

cgc tea geg; gaa age zzz aaa oac -ac gga tea aaa caa aac gac aca aca geg. geg cgc 
RLASSFXXYG^KQrJD'riAVC 
241/81 271/91 

age gaa aac gge zzz caa zzz zzc eee cee cca ace gca tea teg tac cee gga aca acc 
s £ c i q r i ? l : a 5 i v - c. : : 

301/101 331/111 

gca gca czz gee age gac aaa cac ace gaa cgc gaa tea aca cac age cee gee ace gea 
a a ? v s d :-: y r z ?. £ i : :-: s l g : v 

361/121 291/121 

aaa cca cgc aca ace zzz ege tee aa? aac ace tee eaa aaa gea ceg aac gea aaa tee 

:< ? ?. i : f c s :< m t ? o :■: v 1 m v x 3 

421/141 iSl/lSI 

aaa. tea aaa cac gea gaa ace ace ace aca tea gac tea. sac gaa gac tea gga gge tat 

JC L X Y V 3 T I II LD-IISD^GG'f 
.181/151 =11/171 

caa tgc cec aac aac tee ace tee caa aac tee gac ace aac cee gac gea aag aaa tee 

q c t. :j :: ? I s q h s 3 : l d ■/ x k • r 

541/131 571/191 

aaa cca aac tee tec aac cca. gac gac cag; gee geg ttg gea acg cee tee tee ggc aca 

:< ? :i 

SOI/201 
acc gee gee 

T G V 
651/221 

egc aaa gac cee ace tec ggc aac gea ace aac tea acg aca gca ace 

cx3?r?GiJA::J?T7A:i.T v. 

721/241 "1/251 

cca tee cac cac gge tee ggn acg ace acc aca tea gga tac tee ace tge gga -.ce^cga 
?FKK3?GMTTTi.GY?TCGFa 
791/251 511/271 

gee get cca acg cac acg eee gaa gaa aaa tea tee tea caa tea tea eaa gac tat aaa 

v a l r r s s :< - r l c 3 i g ^ 7 -< 

341/281 571/251 

geg gaa age act tea tee gea cca aca tta atg gea tte tte cca aaa ace geg tea gee 



531/211 

et aag gga gee acg cea ace cac aa*? i 

X G V M L 7 H S : 
£91/221 



. aca gea aca 



301/301 

gaa aac tac gac tea 



: ggc gea '. 



221/j-i 
cac tea aaa gaa ate gca eee 

H L X 3 Z A 3 " 3 A ~ L Z i. 
551/321 391/32: 

gaa aee ggg gag itg geg aaa aaa egg tee iaa tea aac tee gee acg eaa ggg tac gga 
1321/ 2 ft-/ " " : " :51 T2=1 

tea aca gaa acc ace teg gee gee tea ace aca ceg gac ace gac gee aga ccg gga tea 
Z. T I T 7 3 A V -„ Z 7 ? T 7 0 V ?. ? G ^ 
1281/:=! 1111/371 

ace ggc aaa. aca gea cea tee cac gee gee aaa gee geg gac eee aca aca gga xaa aee 

T G X I V ? F K A V X V V 3 ? ? 7 G K 2 
1141/331 1171/291 

eeg ggff cca aac gaa ace gga gaa teg tac tee aaa ggc gac acg aca acg aaa ace tac 

LGPSSTCZ-Yrr. CSMZMitS'i- 
1201/401 1231/411 

tac aac aac gaa gaa gee ace aaa gca ace ace aac aaa gac gga tgg teg cgc - — J3 - 

y :j m s z a 7 :< a : - m x d g w i ?. s g 

1251/421 L291/431 

gac ace gee eac tac gac aac gac ggc cac tee eae aee gee? gac agg ceg aag tea tea 
D 1 A Y V 2 3 G I-i T Y I V 3 ?. 1 X 3 1 

1321/441 13S1/451 

ace aaa tac aaa gge tat cag gee gca cee gee gaa ace gag/ gga aca cee tea caa tac 

Z X Y X G Y Q 7 A ? A Z Z S G I 1 - Q 3 
1381/461 1411/471 

ceg eac acc gee gac gee ggc gee ace gge aca ccg gac gaa gec geg ggc gag -_e tea 
?YI^3AGV7C 1 P03AAGS-? 
1441/481 1471/491 

gee gca gge gee gea gea cag ace gga aaa eac cea aac gaa caa acc gea caa aac tee 

A A G V V V Q T G X Y L N 3 Q 1 V G H r 
1301/501 1531/511 

gee tee ace caa gee tea aca gee aaa tgg cea egc gge ggg- geg aaa tee teg gac gaa 
V S S Q V S T A X V/ £* S, 3 G V X ? — 3 £ 
1561/521 1591/531 

ace eee aaa. gea tea are gea aaa ace gac aca aaa geg cea aga eaa, aeg tee gaa aaa 

I ? X G 3 7 G X 2 2 rl X V L ?. Q M ? 2 •< 
1521/541 



ceaogeeaec taceeaceea tceacacgre acceaaceac geeegaceea eaaceeaeee 
eacaaceace eaceeeeeeg aceeeacege ccaceaaaag tgge^egeaa eia) M 



/ 



42 



U Ye et at. / Biochimica et BiophysicaAcza 1339(1997) 39-52 



The overall goal of this study is to increase knowl- 
edge concerning amino acids that might function in 
catalyzing light production by luciferin oxidation by 
obtaining an unique sequence from another subfamily 
of fireflies. Comparison of the amino-acid sequences 
of several firefly luciferases may indicate which 
amino acids are functionally important since these 
should be conserved among the various species. The 
sequence of P. pennsylvanica iuciferase has been 
deposited in the GenBank as entry U31240. 

2. Materials and methods 

2.L Bacterial strains and media 

E. coli strain XLl-Blue MRF' (Stratagene Cat. # 
200301) was used for AZAP Uni-Zap™ library plat- 
ing. E. coli strain XLl-Blue (Stratagene Cat. # 
200268) was used for amplification of AZAP Uni- 
Zap™ library and for recombinant plasmid replication 
and preparation. E. coli strain SOLR™ (Stratagene 
Cat. # 200298) was used as a plasmid host for in 
vivo excision and cDNA library screening. 

2.2. Firefly collection 

Fireflies were collected locally with nets and stored 
in jars until the evening's collection was completed. 
The live fireflies were taken to the laboratory, sorted 
according to species, and immediately frozen in liq- 
uid nitrogen. The frozen fireflies were stored in a 
- 80°C freezer. 

2.3. mRNA isolation and library preparation 

The lanterns of the frozen P. pennsylvanica fire- 
flies were removed under liquid nitrogen and then 
one g of lanterns was pulverized under liquid nitro- 
gen with a mortar and pestle. The lantern RNA was 
isolated by using a Stratagene RNA Isolation Kit 



(Cat. # 200345,. Lot 138) and then the mRNA was 
isolated using Stratagene's Quik™ mRNA Isolation 
Kit (Cat. # 200349, lot 133). A Stratagene ZAP- 
cDNA™ Synthesis Kit (Cat. # 200400, Lot UC122) 
was used for construction of the cDNA library using 
5 jj.g mRNA. In vivo excision of pBluescript® 
phagemid from Uni-ZAP™ XR (Stratagene ZAP- 
cDNA™ Synthesis Kit) was done according to the 
instructions provided to produce a phagemid. 

2.4. Expression 

Expression was achieved by following a procedure 
similar to that of Devine et al. [11]. The instructions 
supplied with Stratagene's ExAssist™ kit were fol- 
lowed for excision. Twelve 150 mm plates were used 
for the first screening and incubated at 37°C overnight. 
The colonies on each individual plate were lifted onto 
a nitrocellulose filter and the filters were placed on 
fresh LB-ampicillin plates. The master plates and 
lifted filters were incubated at 37°C for another 2 h. 
The filters with colonies were switched to 22°C and 
incubated for another 2 h .for expression. The filters 
were removed from the plates, soaked with 1 mM 
firefly luciferin (in 0.1 M Tris acetate, 10 mM 
MgSO^, 2 mM EDTA ? pH 5.0) for 5 min, wrapped 
with Saran Wrap ® and exposed to X-ray film 
overnight. After film development, the positive 
colonies (bioluminescent) were identified (they also 
could be observed, by eye after dark-adaptation). Two 
colonies were picked from the master plates, and 
streaked onto fresh LB-ampicillin plates. Positive 
colonies were purified by two more rounds of plating 
and screening. 

2.5. Sequencing 

The pBluescript® phagemid was used for sequenc- 
ing. DNA sequencing was done by the Sarkeys 
Biotechnology Laboratory in this Department. The 
sequencing was started using T3 and T7 primers. 



Fig. 2. Comparison of the amino-acid sequences from sixteen firefly luciferases. An alignment of the amino-acid residues from the 
sixteen firefly Iuciferase listed in Table 1 was prepared. The shading used indicate: ■ an amino-acid residue conserved in ail 16 
luciferases; H a residue found in Photuris pennsylvanica and at least one other firefly Iuciferase sequence; and a gap in the sequence 
used for alignment of the residues. The alignment was handmade and checked by comparison to an alignment made using the MACAW, 
and Clustal programs. 
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Subsequent sequencing was continued in bothdirec- 
tions using primers designed using the Oligo 4.06 
program (National Biosciences, Inc.) based on the 
determined sequence. The primers were synthesized 
by the Sarkeys Biotechnology Laboratory. 

2.6. Measurement of iuciferase gene size 

The plasmid containing the Iuciferase gene was 
digested with EcoR I and Xhol at 37°C overnight. A 
separate digestion was done using 5am HI and Kpnl 
under the same conditions. The digested samples 
were separated by 1% agarose gel electrophoresis 
with a 1 kb DNA ladder (BRL, Cat. No. 1561057) as 
a standard and uncut plasmid as a. control. 

2. 7. Computer analysis of data 

The following programs and data bases (versions) 
were used: AssemblyLIGN, V 1.0.7; Beauty [19], 
BLAST? AT, and various other search algorithms via 
the BCM Search Launcher of the Human Genome 
Center at the Bavlor College of Medicine, Houston 
TX;. Blocks [20,21]; DNA Strider, V1.2 [22]; Gen- 
Bank, NCBI, release 89.0 [23]; MacVector, V 4.5.2 
[24]- p//MW calculated at the ExPASy Server; 
ProDom. release 28 [25,26], ProSite [27]; PROSITE, 
release 13 [27]; and SWISS-PROT, release 21.0 [28]. 



3. Results 

3.1. Isolation and sequencing of cDNA 

The cDNA library was prepared from the lanterns 
of locally collected P. pennsyluanica fireflies, ex- 
pressed in E. coli, and screened for light production 
after luciferin addition. Since the screening detected 
expressed bioluminescence, only functional cDNA 
sequences were identified. The insert size as deter- 
mined after EcoRl and Xhol digestion and 1% 
agarose gel electrophoresis was about 1.8 kb which is 
sufficient to code for the entire Iuciferase polypeptide 
of approx. 550 amino-acid residues (based on the 
length of other firefly lucif erases). In a restriction 
enzyme-based analysis, the selected clone did not 
contain a Kpnl restriction site but did have a single 
BamHL site within the luciferase-coding sequence 



(which is not found in the Photinus sequence). No 
Kpnl restriction site is present, in the Photinus se- 
quence. Devine et al. [11] found clones of Luciola 
mingrelica cDNA that had either one or two Kpnl 
sites. 



A. Phylogenetic tree 



Lla 

Ua(g) 

Lcr 

Lmi 

Hpa 

Pmi 

Lno 

Ppy 



27 , Ppe2(KW) 
_[ Ppe2<LY) 



24 r PP«YE) 
2SH_.Ppl(OR) 
26 fl_ Ppl<YG> 
1 Ppi(GH) 



25.7 - 



— !— 
20 



-I 



25 20 15 

B. Biological classification 



Order - beetles 



Artftropoda 
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Insects 
Pterygota 

Coieopterea 
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SucerfamitY Elatenodea 



Family Elateriadae 

click beetles 



Canthraroidea 



Lampyridae 



I 

Subfamily Pyrophorinae Lampyrinae 

H — i 

Tribe Pyrepnorini Photintni Lampyrini 



Photurinae- Luootinae- 
V I 



Luciotini 



Pyrophorus PhoPnus 
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H<* 3 Relationships among the amino-acid residues and the 
biological classification of the sixteen sequenced firefly lu- 
ciferases. (A) The relationship among firefly species based on 
firefly Iuciferase amino acid sequences as determined using the 
protein parsimony and the Clustal algorithm in the DNA Star 
program. The length of each pair represents the distance between 
the sequence pairs and the scale beneath the tree measures the 
aenetic distance between sequences. (B). Biological classification 
of the firefly species from which the. Iuciferase has been se- 
quenced. This classification of firefly species was adapted from 
Herring [29] and Campbell [5]. 
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Fig. 1 shows the nucleotide sequence of the P. 
pennsyluanica cDNA and the deduced amino-acid 
sequence for the largest open reading frame. From 
the sequence analysis, the cDNA is 1831 bp long 
with an open reading frame (ORF) of 1635 bp. The 
ORF encodes a protein of 545 amino acids with a 
calculated molecular weight of 60610 daltons. The 5' 
untranslated region contains 6 1 bp and the 3' untrans- 
lated region has 135 bp. The 3' noncoding region 
contains a poly(A) tail of 24 nucleotides. 

3.2. Comparison of the deduced amino-acid se- 
quences 

The amino-acid sequences deduced from the 16 
cDNAs and genes sequenced for firefly luciferases 
have been aligned (see Fig. 2) to allow determination 
of conserved amino-acid residues and suggest possi- 
ble functional portions. There are 154 residues con- 
served among all the luciferases (about 28% of the 
total residues). In the putative P. pennsyluanica lar- 
val luciferase, the amino acids at 276 positions are 
the same at corresponding positions of at least one 
other species. One hundred and fifteen amino-acid 
residues are unique to the putative P. pennsyluanica 
larval enzymes. Of these, 24 residues are conserved 
in all other species. 

The amino-acid compositions of the sequenced 
firefly luciferases are compared in Table 2. The 
calculated isoelectric points of putative P. pennsyl- 
uanica larval enzymes are the highest of all isozymes. 

3.3. Relatedness 

The amino-acid sequences of the firefly luciferases 
were analyzed by protein parsimony using the DNA 
Star program (Fig. 3A). The phylogenetic classifica- 
tion for these beedes is shown in Fig. 3B [5,29]. As 
expected, the various Luciola species are closely 
related. The relationships among the various species 
as determined by luciferase amino-acid sequence ap- 
pear similar to the relationships based on biological 
classification. The percentage of identity and similar- 
ity were calculated for the 16 firefly luciferase se- 
quences using the GCG program BestFit with a gap 
weight of 3.0 (Table 3). The Ppel and Ppel lu- 
ciferases are 57% identical. 
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4. Discussion 

4.1. Related sequences 

When either the cDNA sequence or the predicted 
amino-acid sequence was used as the input sequence 
for computer-based searches for similarity, the high- 
scoring related sequences were the luciferases from 
both the Lampyridae and the Elateridae families, 
4-coumarate CoA ligase, long-chain CoA ligase, 2- 
acylglycerophosphoethanolamine acyitransferase, and 
the peptide-antibiotic-synthesizing enzymes such as 
gramicidin S synthetase and tyrocidine synthetase. 
These relationships have been reported [30-38] for 
the other firefly luciferases. 

4.2. Domain structure 

A domain structure map for the predicted amino- 
acid sequence of Photuris firefly luciferase was de- 
veloped by using the ProSite, ProDom, PRINTS, 
BLOCKS, and PepPepSearch programs. Fig. 4 illus- 
trates these results and Table 4 defines the sites and 
their presumed functions, if known. 

The T-250TLGYFT-256 sequence (see also Fig. 2) 
is the AMP binding block BL00455B. PS00339, the 
AA tRNA ligase EL2 sequence whose consensus se- 
quence is [GSTALVF]-{DENQHRKP}-[GSTA]-[L1- 
VMF]-[DE]-R-[LIVMF]-x-[LIVMSTAG]-[LrVMFY], 
was found as F-431YIVDRLKSL-440 (correct in 
9/10 positions). The ProSite convention is [ambigui- 
ties where indicated amino acids are acceptable] and 
{amino acids not accepted in this position}. The 
G-338YGLTRYSAVLITPDTDVRPGSTG-362 se- 
quence is the domain H that is conserved in acyl- 
adenylate- synthesizing enzymes [30]. The adenylate 
kinase signature, PS00113, with consensus sequence 
[LIVMFYW](3)-D-G-[FY]-P-R-x(3)-[NQ] was tenta- 
tively found as 1-41 1NKDGWLRSGDI-422 (correct 
in 7/12 positions). The G-415WLRSGD-421 se- 
quence is the domain EH that is conserved in acyl- 
adenylate-synthesizing enzymes [30]. The SKL se- 
quence at the C-terminus is the microbody-directing 
sequence and was detected by the SORT program (it 
is also ProSite PS00342). 

The putative AMP-binding domain signature of 
[LIVMFV]-x(2)-[STG](2)-G-[ST]-[STE]-[SG]-x- 
[PALIYM]-K (ProSite P00455) was found as V- 
1 94MFSS GTTG VS K-205 and is highly conserved 
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Fig. 4. Putative functional domains of Phoruris pennsylvanica 
firefly luciferase. The figure is a composite of the BEAUTY map 
and ProDom. ProSite and Blocks maps of possible functional 
domains. The ProDom sequences are: 3894, luciferin 4-mono- 
oxygenase (firefly luciferase); 170, 4-coumarate CoA ligase; 
3895, luciferin 4-monooxygenase; 184, gramicidin S synthetase; 
3896! luciferin 4-monooxygenase; 937, 4-coumarate CoA ligase; 
10747, 4-coumarate CoA ligase; 10748, 4-coumarate CoA ligase; 
10832, 2acylglycerophosphoethanolamine acyltransferase; 185, 
gramicidin S Synthetase; and 389L luciferin 4-monooxygenase. 
The ProSite sequences are: PS00455, putative AMP-binding do- 
main sequence; PS00113, adenylate kinase signature (similar); 
PS00339, aminoacyl-tRNA synthetase class II.2 (similar); and 
PS00342, microbodies C-terminai targeting sequence. The Blocks 
are: BL00455A and BL00455B, the putative AMP-binding do- 
main. 
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ProDoms'and ProSites found in firefly luciferase 



number in parentheses is the number of occurrences 
in the 16 sequences). 

Among the sequences not found (whose possible 
existence was considered on the basis of similarity in 
function) is the consensus pattern of [FYH]-R-X- 
[DE]-X(4, 12MRH]-X(3)-F-X(3)-[DEL ProSite 
PS00179 which is the amino-acid tRNA ligase EL 1 
pattern. ProSites that were sought but not found 
include: the ATP/GTP-binding site motif A (P-loop) 
PS00017, the chloramphenicol acetyltransferase ac- 
tive-site PS0100, the protein kinase signatures 
(PS00107, PS00108, and PS00109), the ubiquitin- 
activating enzyme signatures (PS00536 and 
PS00865), and the acyl-CoA-binding protein signa- 
ture PS00880. Since the P-loop occurs in many pro- 
teins that bind ATP/GTP, it was the subject of a 
search. Protein kinases have binding sites for. ATP 
and there is some similarity. The CoA binding do- 
mains were sought because CoA influences the time- 
course of lieht production by firefly luciferase [39- 
41]. 

The domains that can be recognized involve 
ATP( AMP) -binding sites, regions that interact-with 
ATP, reaions that are involved in reactions leadings 
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the formation of adenylate intermediates, and func- 
tion in peptide synthetases. As expected, there are 
several regions that are found in other firefly lu- 
ciferases. 

4.3. Evolutionary relationships 

Wood [42] has reviewed the information on chemi- 
cal mechanism and evolutionary development of the 
beede luciferases. The level of dissimilarity among 
the beede luciferases is large; only 27% of the 
amino-acid sequence is conserved among the cloned 
beetle luciferases. Wood concluded that the rate of 
evolution of the luciferases is high relative to other 
enzymes. 

Wood presented a tree diagram showing the rela- 
tionship of the beede luciferases with the CoA syn- 
thetases [42]. The closest enzyme to the luciferases is 
4-coumarate:GoA ligase which has 17% identity in 
amino-acid sequences. Wood [42] postulates that the 
firefly luciferases may have evolved from the CoA 
synthetases. CoA influences the pattern of light pro- 
duction by the luciferases. Without CoA, saturating 
concentrations of ATP produce a flash of light. CoA 
prevents the subsequent inhibition of light production 
and allows a sustained production of light. Wood [40] 
and Ford et ai. [41] found that the -SH group of CoA 
was required for stimulation of activity. Other nu- 
cleotide analogs can produce a steady light produc- 
tion similar to that obtained with CoA [43]. These 
results have been interpreted as an enhanced, turnover 
of the enzyme mediated by conformational changes. 
These observations suggest that there may be a vesti- 
gial CoA binding site on the luciferases. The crystal 
structure of Type III chloramphenicol acetyltrans- 
ferase has been solved at a 1.75 A resolution by 
Leslie [44] and the acetyl CoA binding site analyzed 
[45]. The amino-acid residues interacting with CoA 
are F-55, Y-56, F-104, S-148, W-152, K-177, Y-178, 
H-195, and D-199. A similar pattern of residues 
occurs in P. pennsylvanica and other firefly lu- 
ciferases (the residue number corresponds to the P. 
pennsylvanica sequence and the ratio is the number 
of times that the residue is found in the aligned 
position in the 16 luciferases of known sequence): 
F330, 12/16; Y339, 12/16; F367 6/16 and at posi- 
tion 368, 8/16; W416, 16/16; S419, 11/16 or T, 
5/16; K442, 16/16; Y443, 16/16; H460, 12/16; 



and D465 16/16. These amino acids may constitute 
the CoA-binding site. 

4.4. Crystal structure 

The crystal structure of recombinant P. pyralis 
luciferase has been determined at a resolution of 2.0 
A [46]. There are two compact domains — the 
N-terminal 80% and the C-terminal 20%. The N- 
terminal domain contains a /3-barrei and two /3-sheets 
which are flanked by o:-helices. The C-terminal re- 
gion contains two short antiparallel /3-strands and a 
three-stranded mixed /3-sheet, with three helices 
packed against the side. There is a large cleft between 
the two parts (domains). Amino acids conserved in 
the firefly luciferases are in both parts. The P-loop is 
in a loop connecting antiparallel strands 6 and 7 of 
/3-sheet A. There is a large conformational change 
when substrates are bound and establishment of a 
nonpolar environment would insure high quantum 
yield. Conti et al. [46] suggest- that the C-terminal 
region moves to form a cap. The amino-acid residues 
that we suggest^ might be involved in the interaction 
with CoA are found on both sides of the cleft. There 
are two peptide sequences that are labeled with ra- 
dioactive thiourea dioxide, a lysine-reagent [14]. 
Those two peptides are positioned to be part of the 
active site — one in the C-terminal domain and the 
other in the N-terminal domain on faces of the cleft 
that- could meet. Tne Y-339 and S-419 {Phomris 
numbering) suggested above are predicted to be a 
part of the P. pyralis active site containing (P. 
pyralis sequence numbers) S-198, K-206, Y-340, E- 
344, E-389, Y-401, S-420, G-421, and D-422 [46], 

Baldwin [47], in a review of the firefly luciferase 
structure that reveals a new protein fold, concludes 
that 'the mystery remains' as to the molecular mech- 
anism of catalysis. The crystals were obtained with- 
out bound substrates or other ligands and the sug- 
gested active site is based on those amino-acid 
residues that are conserved among a superfamily of 
adenylate -forming enzymes. 

Elucidation of the amino-acid residues involved in 
catalyzing the firefly luciferase reactions awaits the 
results of specific biochemical modification experi- 
ments, additional site-directed mutagenesis studies, 
and further X-ray crystallographic analysis of the 
protein structure with bound reactants.. The large 
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conformational changes that occur during catalysis 
suggest that the dynamics of these changes are impor- 
tant in understanding the mechanism. Firefly lu- 
ciferase differs from the adenylate-forming enzymes 
used, in the structural comparisons described, above 
because luciferase is a monooxygenase and there 
must be conserved amino-acid residues for that activ- 
ity. The tools are rapidly becoming available for a 
better understanding of this biohiminescent process. 
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